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Strain T5 T is the type strain of the species Phaeobacter inhibens Martens et al. 2006, a sec- 
ondary metabolite producing bacterium affiliated to the Roseobacter clade. Strain T5 T was 
isolated from a water sample taken at the German Wadden Sea, southern North Sea. Here 
we describe the complete genome sequence and annotation of this bacterium with a spe- 
cial focus on the secondary metabolism and compare it with the genomes of the 
Phaeobacter inhibens strains DSM 17395 and DSM 24588 (2.10), selected because of the 
close phylogenetic relationship based on the 1 6S rRNA gene sequences of these three 
strains. The genome of strai n T5 T comprises 4, 1 30,897 bp with 3.92 3 protein-coding genes 
and shows high similarities in genetic and genomic characteristics compared to P. inhibens 
DSM 1 7395 and DSM 24588 (2.1 0). Besides the chromosome, strain T5 T possesses four 
plasmids, three of which show a high similarity to the plasmids of the strai ns DSM 1 7395 
and DSM 24588 (2.10). Analysis of the fourth plasmid suggested horizontal gene transfer. 
Most of the genes on this plasmid are not present in the strai ns DSM 1 7395 and DSM 
24588 (2.10) including a nitrous oxide reductase, which allows strain T5 T a facultative an- 
aerobic lifestyle. The G+C content was calculated from the genome sequence and differs 
significantly from the previously published value, thus warranting an emendation of the 
species description. 



BY-SA 



The Genomic Standards Consortium 



Dogs et al. 



Introduction 

Strain T5 T was isolated from a water sample tak- 
en on 25 th of October 1999 above an intertidal 
mud flat of the German Wadden Sea (53°42'20"N, 
07°43'11"E) and found to be closely related to 
the type strain of Roseobacter gallaeciensis [1]. 
Two years later Martens et al. (2006) reclassified 
Roseobacter gallaeciensis as Phaeobacter 
gallaeciensis and described strain T5 T as type 
strain of the species Phaeobacter inhibens. As 
found for various Phaeobacter strains [2-7], P. 
inhibens strain T5t (= DSM 163 74t = LMG 22475t 
= CIP 1092 89 T ) is able to produce the antibiotic 
tropodithietic acid (TDA) [8]. Furthermore, 
strains of P. gallaeciensis and P. inhibens, includ- 
ing strain T5 T , are able to produce a brownish 
pigment, which is the basis of the genus name 
[phaeos - dark, brown) [1]. The epithet of the 
species name points to the strong inhibitory ac- 
tivity of P. inhibens against different taxa of ma- 
rine bacteria and algae [1]. The genus 
Phaeobacter is known to have a high potential for 
secondary metabolite production, as indicated by 
biosynthesis of TDA and iV-acyl homoserine lac- 
tones (AHLs), as well as presence of genes coding 
for polyketide synthases (PKS) and nonribosomal 
peptide synthetases (NRPS) [2,7-10]. Biosynthe- 
sis of many different bioactive natural products 
is mediated by PKSs or NRPSs, including antibiot- 
ics, toxins and siderophores. Moreover, produc- 
tion of volatile compounds is widespread over 
the Roseobacter clade. It displays a particularly 
high proportion of volatile sulfur-containing 
compounds and thus seems to play an important 
role in the sulfur cycle of the ocean [11]. The sul- 
fur-containing TDA, for which the sulfur precur- 
sor has not yet been determined, plays an im- 
portant role in the mutualistic symbioses of P. 
inhibens and marine algae [12]. p-Coumaric acid 
causes the organism to switch from a state of mu- 
tualistic symbiosis to a pathogenic lifestyle in 
which toxicity is mediated via the production of 
the algicidal roseobacticides, which, like p- 
coumaric, is also a sulfur-containing metabolite 
[13,14]. 

Here we present the genome of P. inhibens strain 
T5 T with particular emphasis on the genes in- 
volved in secondary metabolism and comparison 
with the recently published genomes of the P. 
inhibens strains DSM 17395 and DSM 24588 
(2.10) [3]. DSM 17395 and DSM 24588, originally 



deposited as P. gallaeciensis strains, were recent- 
ly reclassified as P. inhibens [15]. 

Classification and features 
1 6S rRNA gene analysis 

Figure 1 shows the phylogenetic neighborhood of 
P. inhibens DSM 16374 T in a tree based on 16S 
rRNA genes. The sequences of the three identical 
16S rRNA gene copies differ by one nucleotide 
from the previously published 16S rRNA sequence 
(NCBI Accession No. AY177712). 

A representative genomic 16S rRNA gene se- 
quence of P. inhibens DSM 16374 T was compared 
using NCBI BLAST [16,17] under default settings 
(e.g., considering only the high-scoring segment 
pairs (HSPs) from the best 250 hits) with the 
most recent release of the Greengenes database 
[18] and the relative frequencies of taxa and 
keywords (reduced to their stem [19]) were de- 
termined, weighted by BLAST scores. The most 
frequently occurring genera were Ruegeria 
(32.5%), Phaeobacter (28.8%), Silicibacter 
(13.6%), Roseobacter (13.3%) and Nautella 
(3.5%) (141 hits in total). Regarding the single 
hit to sequences from the species, the average 
identity within HSPs was 99.8%, whereas the av- 
erage coverage by HSPs was 99.3%. Regarding 
the nine hits to sequences from other species of 
the genus, the average identity within HSPs was 
99.0%, whereas the average coverage by HSPs 
was 99.2%. Among all other species, the one 
yielding the highest score was P. gallaeciensis 
(NZ_ABIF01000004), which corresponded to an 
identity of 100.0% and an HSP coverage of 
100.0%. (Note that the Greengenes database uses 
the INSDC (= EMBL/NCBI /DDBJ) annotation, 
which is not an authoritative source for nomen- 
clature or classification). The highest- scoring en- 
vironmental sequence was AJ296158 
(Greengenes short name 'SpaimGalicia isolate str. 
PP-154'), which showed an identity of 99.8% and 
an HSP coverage of 100.0%. The most frequently 
occurring keywords within the labels of all envi- 
ronmental samples which yielded hits were 
'microbi' (3.1%), 'marine' (2.6%), 'coral' (2.3%), 
'biofilm' (2.1%) and 'membrane, structure, swro' 
(1.8%) (100 hits in total). Environmental sam- 
ples which yielded hits of a higher score than the 
highest scoring species were not found. 
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Morphology and physiology 

Cells of T5 T are ovoid rods, 1.4-1.9 x 0.6-0.8 |im 
(Figure 2). Furthermore, T5 T cells show the typical 
multicellular star-shaped structure described pre- 
viously for P. gallaeciensis and other Roseobacter- 
clade organisms [2,4,47] (Figure 2). Cells of T5 T 
are motile by means of a polar flagellum. T5 T is a 
Gram- negative, marine, facultatively anaerobic, 
mesophilic bacterium with an optimal growth 
temperature between 27 and 29 °C and an optimal 
salinity between 0.51 and 0.68 M. The pH range 
for growth is 6.0 - 9.5, with an optimum at 7.5. On 
marine agar T5 T forms smooth and convex colo- 
nies with regular edges and brownish pigmenta- 
tion on ferric citrate containing media. T5 T utilizes 
pentoses, hexoses, di saccharides and most amino 
acids as carbon and energy sources. No vitamin 
requirements were observed [1]. 



Chemotaxonomy 

There are no significant differences between the 
fatty-acid profile of strain T5 T and other repre- 
sentatives of the Roseobacter clade [1]. Strain T5 T 
has the highest profile similarity to P. gallaeciensis 
CIP 105210 T [1]. The principal cellular fatty acids 
of strain T5 T are the following saturated 
branched-chain fatty acids: Ci8:i U 7c (73.77%), 11- 
methyl Ci 8: i U 7c (7.45%), Ci 6:0 (3.83%), Ci 8:0 
(3.14%), 2-OH Cim (3.10%), &« (2.19%), 3-OH 
Cio:o (1.71%), 3-OH Ci2:o (1.59%), 3-OH Cu:i/ 3 
oxo-Ci4:o (0.87%), Ci8.i U 9c (0.76%) and an unam- 
biguously identified fatty acid (1.59%) [1]. The 
major polar lipids of strain T5 T comprise 
phosphatidylglycerol, phosphatidylethanolamine, 
phosphatidylcholine, an aminolipid and two uni- 
dentified lipids [1]. 



Ruegena marina (FJ872535) 

■ Ruegena pomeroyi (AF098491) " 
Ruegena halocynthiae (HQ852038) 



- Ruegena aVantica (D88526) 

Ruegena lacuscaerulensis (U77644) ' 



- Ruegena mobilis (AB255401) 



Ruegeria scottomollicae (AM905330) 



Ruegena faecimaris (GU057915) 

— Leisingera methylohalidivorans (AY005463) " 

62/65 

Leisingera aquimanna (AM900415) ' 



- Leisingera nanhaiensis (FJ232451) * 



- Phaeobacter caeruleus (AM943630) ' 

— Phaeobacter daeponensis (DQ981486) * 
Phaeobacter arcticus (DQ514304) 



Phaeobacter inhibens (IMG2517045822) * 
Phaeobacter gallaeciensis (Y13244) " 



Figure 1. Phylogenetic tree highlighting the position of P. inhibens relative to the type strains of the other species with- 
in the genus Phaeobacter and the neighboring genera Leisingera and Ruegeria [1,20-33]. The tree was inferred from 
1,385 aligned characters [34,35] of the 16S rRNAgene sequence under the maximum likelihood (ML) criterion [36]. 
Rooting was done initially using the midpoint method [37] and then checked for its agreement with the current classi- 
fication (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent 
to the branches are support values from 1,000 ML bootstrap replicates [38] (left) and from 1,000 maximum-parsimony 
bootstrap replicates [39] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in 
GOLD [40] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [21]. The 
genomes of six more Leisingera and Phaeobacter species are published in the current issue of Standard in Genomic 
Science [41-46]. The 1 6S rRNA sequences of P. inhibens strain DSM 24588 and P. inhibens strain DSM 1 7395 are vir- 
tually identical to those of P. inhibens DSM 1 6374' (data not shown). 
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Figure 2. Scanning electron microscope pictures of P. inhibens strain DSM 16374 T showing (a) the 
typical cell ovoid shape of strain T5 T and (b) the multicellular, star-shaped structure as described 
previously for Phaeobacter and further /?oseo6acfer-clade organisms. 



Table 1. Classification and general features of P. inhibens T5 T according to the MIGS recommendations [48]. 



MIGS ID 


Property 


Term 


Evidence code 






Domain Bacteria 


TAS [49] 






Phylum Proteobacteria 


TAS [50] 






Class Alphaproteobacteria 


TAS [51,52] 




Current classification 


Order Rhodobacterales 


TAS [52] 






Family Rhodobacteraceae 


TAS [52,53] 






Genus Phaeobacter 


TAS [1 ] 






Species Phaeobacter inhibens 


TAS [1] 






Type strain T5 


TAS [1,8] 




f"i {A m sta i n 

* — 1 1 Ol 1 1 1 3 LCI 1 1 1 


nppativp 

1 ice all V c 


TAS [1] 




Cell shape 


rod shaped 


TAS [1] 




Motility 


motile 


TAS [1] 




Sporulation 


none 


TAS [1] 


MICS-6.1 


Temperature range 


mesophile 


TAS [1] 


MICS-6.1 


Optimum temperature 


2 7-29°C 


TAS [1] 


MIGS-6.3 


Salinity 


0.01- <1.5M NaCl 


TAS [1] 


MIGS-22 


Oxygen requirement 


facultative anaerobic 


IDA 




Carbon source 


oligosaccharides, sugar alcohols, organic acids, amino acids 


TAS [1] 




Energy metabolism 


heterotrophic 


TAS [1] 


MIGS-6 


Habitat 


marine 


TAS [1] 


MIGS-15 


Biotic relationship 


unknown 


NAS 


MIGS-14 


Pathogenicity 


none 


TAS [1] 




Biosafety level 


1 


TAS [54] 


MIGS-23 


Isolation 


water sample above an intertidal mud flat 


TAS [1,8] 


MIGS-4 


Geographic location 


German Wadden Sea 


TAS [1,8] 


MIGS-5 


Sample collection time 


October 2 5, 1999 


TAS [1,8] 


MIGS-4. 1 


Latitude 


53°42'20"N 


TAS [1,8] 


MIGS-4.2 


Longitude 


07°43'11"E 


TAS [1,8] 


MIGS-4. 3 


Depth 


Above sea ground 


TAS [1,8] 


MIGS-4. 4 


Altitude 


unknown 





Evidence codes - IDA: Inferred from direct assay; TAS: Traceable Author Statement (i.e., a direct report exists in the lit- 
erature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a 
generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology pro- 
ject [55]. 
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Genome sequencing and annotation 

Genome project history 

This organism was selected for sequencing on the 
basis of the DOE Joint Genome Institute Community 
Sequencing Program (CSP) 2010, CSP 441 "Whole 
genome type strain sequences of the genera 
Phaeobacter and Leisingera - a monophyletic group 
of physiological highly diverse organisms". The ge- 
nome project is deposited in the Genomes On Line 



Database [40] and the complete genome sequence 
is deposited in GenBank and the Integrated Micro- 
bial Genomes database (IMG) [56]. Sequencing, fin- 
ishing and annotation were performed by the DOE 
Joint Genome Institute QGI) using state of the art 
sequencing technology [57]. A summary of the pro- 
ject information is shown in Table 2. 



Table 2. Genome sequencing project information 



MIGS ID Property 



Term 



MIGS-31 Finishing quality 
MIGS-28 Libraries used 
MIGS-29 Sequencing platforms 
MIGS-31. 2 Sequencing coverage 
MIGS-30 Assemblers 
MIGS-32 Gene calling method 
INSDC ID 

GenBank Date of Release 
GOLD ID 
NCBI project ID 
Database: IMG-GEBA 
MIGS-13 Source material identifier 
Project relevance 



Permanent draft 

Two lllumina paired-end libraries (225 bp and 9kb insert size) 
lllumina GAii, PacBio 
1,111 x lllumina 

Allpaths, Velvet 1 .1 .05, phrap version SPS - 4.24 

Prodigal 1.4, GenePRIMP 

AXBB00000000 

September 30, 201 3 

Gi10860 

881 11 

2516653078 
DSM 16374 

Tree of Life, carbon cycle, sulfur cycle, environmental 



Growth conditions and DNA isolation 

A culture of DSM 16374 T was grown aerobically in 
DSMZ medium 514 [58] at25°C. Genomic DNA was 
isolated using the Jetflex Genomic DNA Purification 
Kit (GENOMED 600100) following the standard 
protocol provided by the manufacturer but modi- 
fied by an incubation time of 40 min, the incubation 
on ice over night on a shaker, the use of additional 
10 u.1 proteinase K, and the addition of 100 u.1 pro- 
tein precipitation buffer. DNA is available from 
DSMZ through the DNA Bank Network [59]. 

Genome sequencing and assembly 

For this genome, we constructed and sequenced an 
lllumina short-insert paired-end library with an 
average insert size of 225 bp, and an lllumina long- 
insert paired-end library with an average insert 
size of 9602 bp, which generated 18,471,132 reads 
and 11,906,846 reads, respectively, totaling 4,557 
Mbp of lllumina data. All general aspects of library 
construction and sequencing performed can be 
found at the JGI website [60]. The initial draft as- 
sembly contained 13 contigs in 10 scaffold. The ini- 
tial draft data was assembled with Allpaths [61] 
and the consensus was computationally shredded 



into 10 kbp overlapping fake reads (shreds). The 
lllumina draft data was also assembled with Velvet 
[62], and the consensus sequences were computa- 
tionally shredded into 1.5 kbp overlapping fake 
reads (shreds). The lllumina draft data was assem- 
bled again with Velvet using the shreds from the 
first Velvet assembly to guide the next assembly. 
The consensus from the second Velvet assembly 
was shredded into 1.5 kbp overlapping fake reads. 
The fake reads from the Allpaths assembly and 
both Velvet assemblies and a subset of the lllumina 
CLIP paired-end reads were assembled using paral- 
lel phrap (High Performance Software, LLC) [63]. 
Possible mis-assemblies were corrected with man- 
ual editing in Consed [63]. Gap closure was accom- 
plished using repeat resolution software (Wei Gu, 
unpublished), and sequencing of bridging PCR 
fragments with PacBio technologies. A total of 10 
PCR PacBio consensus sequences were completed 
to close gaps and to raise the quality of the final 
sequence. The final assembly is based on 4,557 
Mbp of lllumina draft data, which provides an aver- 
age 1,111 x coverage of the genome. 
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Genome annotation 

Genes were identified using Prodigal [64] as part 
of the DOE-JGI genome annotation pipeline [65], 
followed by a round of manual curation using the 
JGI GenePRIMP pipeline [66]. The predicted CDSs 
were translated and used to search the National 
Center for Biotechnology Information (NCBI) 
nonredundant database, UniProt, TIGR-Fam, Pfam, 
PRIAM, KEGG, COG, and InterPro databases. Addi- 
tional gene prediction analysis and functional an- 
notation were performed within the Integrated 
Microbial Genomes - Expert Review (IMG-ER) 
platform [56]. 



Genome properties 

The genome statistics are provided in Table 3 and 
Figure 3. The genome consists of six scaffolds with 
a total length of 4,130,897 bp and a G+C content of 
60.0%. The scaffolds correspond to a chromosome 
3,669,861 bp in length and four extrachromosomal 
elements as identified by their replication systems 
(see below]. Of the 3,986 genes predicted, 3,923 
were protein-coding genes, and 63 RNAs; 39 
pseudogenes were also identified. The majority of 
the protein-coding genes (81.0%) were assigned a 
putative function while the remaining ones were 
annotated as hypothetical proteins. The distribu- 
tion of genes into COGs functional categories is pre- 
sented in Table 4. 



Table 3. Genome Statistics 


Attribute 


Value 


% ot total 


Genome size (bp) 


a 1 on q n~7 

4. 1 jU.oy/ 


1 nn r\ r\o/ 
I 00.00 /o 


DNA coding region (bp) 


3. 683.922 


QO 1 QO/ 

8y. I 8 /o 


DNA G+C content (bp) 


2.479.086 


60.02% 


Number of scaffolds 


6* 




Extrachromosomal elements 


4 




Total genes 


3.986 


100.00% 


RNA genes 


63 


1.58% 


rRNA genes 


5* 


0.13% 


rRNA operons 


1* 




Protein-coding genes 


3.923 


98.42% 


Pseudo genes 


39 


0.98% 


Genes with function prediction 


3.228 


80.98% 


Genes in paralog clusters 


1.234 


30.96% 


Genes assigned to COGs 


3.178 


79.73% 


Genes assigned Pfam domains 


3.395 


85.17% 


Genes with signal peptides 


306 


7.68% 


Genes with transmembrane helices 


841 


21.10% 


CRISPR repeats 


0 





*An updated version of the genome assembly (unpublished data) 
reveals the presence of four rRNA operons. In this version, two of 
the rRNA operons are incomplete, with the 16S rRNA gene and 
the 5S rRNA gene hidden in the two remaining sequencing gaps. 
The genomes of P. inhibens strain DSM 24588 (2.10) and P. 
inhibens strain DSM 17395 (see Figure 3) also contain four rRNA 
operons. 
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QS- system II 
phage 2 





paaZ2 gene 

Ha dependent 
transporter 

tdaABCDCF cluster 



Toxin/ Antitoxin 



NRPSynthetase 
dltA gene cluster 



type IV secretion system 



QS-system I 



NRPSynthetase 
Polyketide synthase, 



pro-phage 1 
QS-system III 



NRPSynthetase 



siderophore 
gene cluster. 






nitrite reductase 

nitric oxide 
reductase cluster 



ABC sugar transporter 



Nitrous oxide 
reductase cluster 



type I secretion system 



glycosyltransferase-like protein 



Figure 3. Graphical representation of the genome of P. inhibens T5 T . From outside to the center: (1) se- 
quence of P. inhibens T5 T , (2) results of a blastn comparison from P. inhibens DSM 24588 (2.10) against P. 
inhibens T5 T , (3) results of a blastn comparison of P. inhibens DSM 17395 against P. inhibens T5 T , (4) G+C 
content. Comparisons and visualization are done with BRIG [67]. 



Insights into the genome 

Genome sequencing of P. inhibens DSM 16374 T 
revealed the presence of four extrachromosomal 
elements with sizes of 227 kb, 88 kb, 78 kb, and 69 
kb (Figure 3; Table 5) and DnaA-like I, RepABC-8, 
RepB-I and RepA-I as replication systems, respec- 
tively [68]. The different replicases that mediate 
the initiation of replication are designated accord- 
ing to the established plasmid classification 
scheme [69]. With the exception of the 88 kb re- 
plicon, these extrachromosomal elements are 
highly syntenic to specific replicons in the ge- 
nomes of P. inhibens strains DSM 17395 and DSM 
24588 (Figure 3). 

The locus tags of all replicases, plasmid stability 
modules and the large virB4 gene of a type IV se- 
cretion system are presented in Table 6. The 



plasmids pInhi_A227 and pInhi_B88 contain 
postsegregational killing systems (PSK) consisting 
of a typical operon with two small genes encoding 
a stable toxin and an unstable antitoxin [70]. 
Moreover, plasmid pInhi_B88 also contains a 
complete virB gene cluster of type IV secretion 
system, required for the formation of a 
transmembrane channel. However, the absence of 
the relaxase VirD2, which is necessary for the 
strand-specific DNA nicking at the origin of trans- 
fer [oriT], and the coupling protein VirD4 indi- 
cates that this plasmid is non-conjugative [71,72]. 
The RepA-I type replicon pInhi_D69 contains a 
complete rhamnose operon [73] and is dominated 
by genes required for polysaccharide biosynthesis. 



340 



Standards in Genomic Sciences 



Dogs et al. 



Table 4. Number of genes associated with the general COG functional categories 



Code 


Value 


%age 


Description 


J 


170 


4.87 


Translation, ribosomal structure and biogenesis 


A 


1 


0.03 


RNA processing and modification 


K 


2 74 


7.85 


Transcription 


L 


1 37 


3.92 


Replication, recombination and repair 


B 


3 


0.09 


Chromatin structure and dynamics 


D 


29 


0.83 


Cell cycle control, cell division, chromosome partitioning 


Y 


n. a. 


n. a. 


Nuclear structure 


V 


45 


1.29 


Defense mechanisms 


T 


156 


4.47 


Signal transduction mechanisms 


M 


199 


5.70 


Cell wall/membrane biogenesis 


N 


52 


1.49 


Cell motility 


Z 


n. a. 


n. a. 


Cytoskeleton 


w 


n. a. 


n. a. 


Extracellular structures 


u 


60 


1.72 


I ntracel I ular trafficki ng and secretion, and vesicular transport 


o 


127 


3.64 


Posttranslational modification, protein turnover, chaperones 


c 


204 


5.84 


Energy production and conversion 


G 


197 


5.64 


Carbohydrate transport and metabolism 


E 


425 


12.17 


Amino acid transport and metabolism 


F 


79 


2.26 


Nucleotide transport and metabolism 


H 


152 


4.35 


Coenzyme transport and metabolism 


1 


146 


4.18 


Lipid transport and metabolism 


P 


174 


4.98 


Inorganic ion transport and metabolism 


Q 


110 


3.15 


Secondary metabolites biosynthesis, transport and catabolism 


R 


446 


12.77 


General function prediction only 


S 


306 


8.76 


Function unknown 




808 


20.27 


Not in COGs 
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Table 5. General genomic features of the chromosome and extrachromosomal re- 
plicons of Phaeobacter inhibens strain DSM 16374 T 



Replicon 


Scaffold 


Length (bp) 


GC (%) 


Topology 


No. Genes* 


clnhi_A3361 


1 


3,361,358 


60 


linear* 


3,282 


clnhi_B309 


2 


308,503 


61 


linear* 


288 


plnhi_A22 7 


3 


22 6,687 


59 


linear* 


208 


nlnhi R88 


4 


87,579 


58 


I i near* 


93 


plnhi_C78 


5 


78,203 


63 


linear* 


62 


plnhi_D69 


6 


68,567 


63 


linear* 


53 



'deduced from automatic annotation 
*circularity not experimentally validated. 



Table 6. Integrated Microbial Genome (IMG) locus tags of P. inhibens DSM 1 6374 T genes for the ini- 
tiation of replication, toxin/antitoxin modules and two representatives of type IV secretion systems 

( T4SS) that are required for conjugation. 

Replication Plasmid Type IV Replication Plasmid 
Replicon Initiation Stability Secretion Replicon Initiation Stability 

Replicase Locus Tag Toxin Antitoxin VirB4 VirD4 



clnhi_A3361 DnaA lnhi_2434 



clnhi_B309 



plnhi_A227 DnaA-like I lnhi_3576 Inhi _3735 Inhi _3734 



plnhi_B88 RepC-8 lnhi_3797 Inhi _3865 Inhi _3866 Inhi _3845 



plnhi_C78 RepB-l lnhi_3883 



plnhi_D69 RepA-l Inhi 3972 
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As already indicated by the strong inhibitory ac- 
tivity of P. inhibens T5 T [8] all 26 described genes 
involved in the production of TDA are present in 
the genome of this strain. As found for the P. 
inhibens strains DSM 17395 and DSM 24588, the 
key genes for TDA production tdaABCDEF 
(Inhi_3684 - _3688, Inhi_3701), paaZ2 
(Inhi_3702) and a gene coding for a putative Na- 
dependent transporter (Inhi_3697) [3,74] are lo- 
cated on the 227 kb plasmid of T5 T (Figure 3). The 
remaining 19 genes, containing genes of the 
phenylacetyl-CoA and assimilatory sulfate reduc- 
tion pathways, are scattered over the chromo- 
some as in the strains DSM 17395 and DSM 24588 
[3]. Beside the tdaA gene, present on the 227 kb 
plasmid, we also found other genes involved in the 
regulation of TDA synthesis located on the chro- 
mosome, what is in agreement with Thole et al. 
(2012) and Berger et al. (2012) [3,75]. This in- 
cludes the genes encoding transcriptional activa- 
tor proteins (Inhi_2121; _2059; _0396) compara- 
ble with pgaR, iorR a transcriptional regulator 
(PGAl_c20730), a putative serine-protein kinase 
(Inhi_2265) and a putative signal peptide pepti- 
dase (Inhi_2227). 

Two complete prophages and an additional cluster 
coding for the production of gene transfer agents 
(GTA) were found in the genome of strain T5 T . The 
GTA gene cluster is equal in length and comprises 
the same genes (Inhi_0654 - Inhi_0670) as the 
GTA clusters of the strains DSM 17395 and DSM 
24588. The two prophages of strain T5 T consist of 
52 ORFs (prophage 1; ~37kb) and 63 ORFs 
(prophage 2; ~48kb), respectively. Strain DSM 
17395 possesses two prophages, but for DSM 
24588 no prophages were detected [3]. Prophage 
1 of strain T5 T is similar to prophage 1 of strain 
DSM 17395, with the exception that a few ORFs 
are different (PGAl_cl8280 - _cl8310, 
PGAl_cl8480 - _cl8530 and PGAl_cl8570 - 
_cl8680; Inhi_1777, Inhi_1785 - _1788, Inhi_1803 
- _1812 and Inhi_1816 - 1829). Prophage 2 of 
strain T5 T is a Mu-like bacteriophage, not present 
in strain DSM 17395. 

It was previously shown that strain T5 T produces 
two different AHLs, i.e. Gs-en-HSL and N-3- 
hydroxydecanoyl-homoserine lactone (30HGo- 
HSL) [76]. In P. inhibens strain DSM 17395 TDA 
and pigment production are regulated via a pgaR- 
pgal QS system [47]. The AHL synthase encoding 
gene pgal in DSM 17395 is responsible for the 
production of 30HGo-HSL. In the genome of strain 



T5 T we found a homologous system probably cod- 
ing for the 30HCio-HSL producing AHL synthase 
(Inhi_2120, homolog to pgal) and the respective 
regulator (Inhi_2121, homologous to pgaR) (Fig- 
ure 3, QS system I). Thus TDA production in strain 
T5 T might also be regulated by a QS system. In ad- 
dition, two further QS systems (QS system II and 
III; Figure 3) were found on the chromosome of 
T5 T . System II is formed by the genes Inhi_0506 
and _0507 and is located in the prophage region 2. 
Orthologs for these QS system genes are also pre- 
sent in P. inhibens strain DSM 24588 
(PGA2_cl8960 and PGA2_cl8970) but absent in 
strain DSM 17395. QS system III consists of the 
genes Inhi_1819 and _1820 and is unique for 
strain T5 T compared to P. inhibens DSM 17395 and 
DSM 24588. It is also located in the potential 
prophage 1 region (Fig. 3). A homologous system 
was found in the genome of Phaeobacter caeruleus 
DSM 24564 T and the neighboring genes show a 
high synteny. The location in the prophage region 
and the high synteny to the system of P. caeruleus 
suggest a possible gene transfer of this QS system 
via a bacteriophage. The functions of QS system II 
and III are currently unknown, but it is likely that 
the compound Gs-en-HSL is produced by one of 
those systems. 

Two functions were suggested that can possibly 
be used as unique chemotaxonomic markers for 
the species P. inhibens within the Roseobacter 
clade [3]. The genes coding for the first of these 
functions are located on the chromosome and are 
involved in cell wall development and surface at- 
tachment [dltA encoding a D-alanine- 
poly(phosphoribitol) ligase involved in biosynthe- 
sis of D-alanyl-lipoteichoic acid]. The second 
unique function is the biosynthesis and transport 
of iron-chelating siderophores, and the encoding 
genes are located on the plasmid pPGAl_78 and 
pPGA2_95, respectively. These two clusters are 
also present in the genome of strain T5 T . The 
siderophore gene cluster (Inhi_3924 - Inhi_3928) 
is located on the 78 kb plasmid (Fig. 3) and the 
dltA gene cluster (Inhi_1065 - Inhi_1086) is locat- 
ed on the chromosome (Fig. 3). Screenings in the 
newly available Roseobacter genomes showed that 
Leisingera methylohalidivorans DSM 14336 [42] 
and Leisingera aquimarina DSM 24565 [41] also 
harbor the genes for siderophore synthesis. The 
uniqueness of the dltA gene cluster within the 
species P. inhibens, however, remains and can be 
used as chemotaxonomic marker. 
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The existence of genes coding for a polyketide 
synthase (Inhi_1972) and three non-ribosomal 
peptide synthetases (Inhi_1072, _1974 and _3983) 
confirm the results of Martens et al. (2007) [7]. 
These genes are present in the genomes of strains 
DSM 17395 and DSM 24588, too (PGAl_c04930 
and PGAl_c05350, _cl3760, _c28490; 
PGA2_c05370 and PGA2_c04910, _c 13660, 
_71pll0). The genes Inhi_3983 of P. inhibens 
strain T5t and PGA2_71pllO of P. inhibens strain 
DSM 24588 are located on the 69 kb plasmid (Fig 
3) and 71 kb plasmid, respectively. In contrast, the 
homologous gene (PGAl_c28490) of P. inhibens 
strain DSM 17395 is located on the chromosome. 

For the P. inhibens strains DSM 17395 and DSM 
24588 a surface-attached lifestyle was inferred 
from the genome analysis [3]. Even though strain 
T5 T was isolated from a water sample, it exhibits 
the same genes associated with the biosynthesis 
and transport of polysaccharides as strains DSM 
17395 and DSM 24588. This includes genes de- 
scribed as unique for the strains DSM 17395 and 
DSM 24588, i.e. a gene coding for a 
glycosyltransferase-like protein (Inhi_3961) and 
two ORFs (Inhi_3954 and Inhi_3955) related to a 
type I secretion system and used for the transport 
of exopolysaccharides. Production of extracellular 
polysaccharides is a major factor contributing to 
surface attachment [77,78]. Thus it appears likely 
that T5 T is also well-adapted to a surface attached 
lifestyle. 

P. inhibens was described as a strictly aerobic 
bacterium [1]. However, we found genes involved 
in the dissimilatory nitrate reduction pathway to 
nitrogen, including the gene coding for a copper 
containing nitrite reductase (Inhi_3645) and a 
nitric oxide reductase cluster (Inhi_3648 - 
Inhi_3654), both located on the replicon 
pInhi_A227. These genes are also present and 
located on the largest plasmids of P. inhibens 
DSM 17395 (PGAl_262p) and P. inhibens DSM 
24588 (PGA2_239p) (Figure 3). In addition, P. 
inhibens strain T5 T possesses a gene cluster cod- 
ing for a nitrous oxide reductase (Inhi_3786 - 
Inhi_3792) located on the replicon pInhi_B88, 
which is absent in the strains DSM 17395 and 
DSM 24588 (Figure 3). Neither strain T5t nor 
DSM 17395 and DSM 24588 have genes coding 
for a nitrate reductase. The findings suggest that 
P. inhibens T5 T has a complete dissimilatory ni- 
trite reduction pathway, but is not able to reduce 
nitrate, as previously described by Martens etal. 
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(2006) [1]. To confirm the results we tested 
strain T5 T for its capability to grow anaerobically 
with nitrite. Anaerobic marine basal medium was 
prepared according to Cypionka and Pfennig 
(1986) [79] and supplemented with nitrite and 
glucose, both in a final concentration of 5 mM. 
After two weeks a decrease of nitrite was deter- 
mined by photometric analysis at 545 nm by us- 
ing the Griess reaction [80] and an increase of the 
turbidity was detected (results not shown). Thus 
it became clear that P. inhibens T5 T is able to 
grow anaerobically with nitrite, suggesting an 
emended description of this organism as a 
facultatively anaerobic bacterium. 

Phylogenetic analysis shows that P. inhibens and 
P. gallaeciensis form a cluster together with 
Phaeobacter arcticus (Figure 1). The cluster is set 
apart from the cluster comprising Leisingera 
aquimarina, Leisnigera nanhaiensis, Leisingera 
methylohalidivorans, Phaeobacter caeruleus and 
Phaeobacter daeponensis, but the backbone of the 
16S rRNA gene tree shown in Figure 1 is rather 
unresolved. Using the online analysis tool "Ge- 
nome-to- Genome Distance Calculator" 2.0 (GGDC) 
[81,82], we performed a preliminary phylogenet- 
ic analysis of the draft genomes of the type 
strains of the genera Leisingera and Phaeobacter 
and the finished genomes of P. inhibens strains 
DSM 17395 and DSM 24588. Table 7 shows the 
results of the in silico calculated DNA-DNA hy- 
bridization (DDH) similarities of P. inhibens to 
other Phaeobacter and Leisingera species. In the 
following analysis, we will refer only to the re- 
sults of formula 2, as this formula is robust 
against the use of draft genomes such as 
AOQA01000000 (CIP 1052 10 T ) [83]. The use of 
GGDC revealed a high similarity of T5t (78%) to 
the strains P. inhibens DSM 17395 and DSM 
24588, but a low similarity to P. gallaeciensis 
strain CIP 1052 10* (36%). DSM 17395 and CIP 
10521 0 T were previously supposed to be type- 
strain deposits for P. gallaeciensis [33] and we 
cross-compared them using GGDC. Formula 2 
yielded a similarity of only 38.30% ± 2.50 be- 
tween these two strains, thus indicating not only 
that they are not the same strain, but also do not 
even belong to the same species. The results are 
in agreement with the study of Buddruhs et al. 
(2013) [15] showing that strain DSM 17395 is 
the false deposit and belongs together with DSM 
24588 to P. inhibens, whereas CIP 10521 0t is the 
correct type-strain deposit for P. gallaeciensis. 
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The differences in the G+C content (55.7%) pub- 
lished earlier [1] and the value calculated directly 
from the genome (Table 3) warrants an update of 
the taxonomic description on P. inhibens [84]. 



Moreover, genomic and experimental evidence 
indicates that P. inhibens is not strictly aerobic but 
facultatively anaerobic. 



Table 7. Digital DDH similarities between P. inhibens T5 T and the other Phaeobacter and Leisingera spe- 
cies (including the genome-sequenced type strains and P. inhibens strains DSM 17395 and DSM 24588 
[2,10]) calculated in silico with the GGDC server version 2.0 [83] + . 



Reference strain (type strain unless indicated) 


formula 1 


formula 2 


formula 3 


P. arcticus DSM 23566 T (AXBF00000000) 


17.60+3.30 


21.70+2.44 


17.80+2.95 


P. caeruleus DSM 24564 T (AXBI00000000) 


19.00+3.36 


20.60+2.46 


19.10+2.99 


P. daeponensis DSM 23529 (AX B DO 00 000 00) 


20.40+3.41 


22 .60+2 .46 


20.20+3.03 


P. gallaeciens is CIP 1 052 1 0 T (AOQA01 000000) 


78.40+3.76 


36.20+2.57 


68.50+3.52 


P. inhibens DSM 17395 (CP002976, CP002977, 
CP002978, CP002979) 


yu.ou±z . / O 


/ O.jU+Z . 3>0 




P. inhibens DSM 24588 (2.10) (CP002972, 
CP002973, CP002974, CP002975) 


94.50+2.03 


78.40±2.98 


94.20+1.92 


L aquimarina DSM 24565 T (AX BE 000 000 00) 


18.50+3.34 


22.50+2.45 


18.70+2.98 


L methylohalidivorans DSM 1 4336 T (CP006773, 
CP006774, CP006775) 


19.40+3.38 


22.50+2.45 


19.40+3.01 


L nanhaiensis DSM 24252 T (AXBG00000000) 


14.00+3.08 


21.00+2.42 


14.70+2.78 



f The standard deviations indicate the inherent uncertainty in estimating DDH values from intergenomic 
distances based on models derived from empirical test data sets (which are always limited in size); see 
[83] for details. The distance formulas are explained in [82]; formula 2 is recommended, particularly for 
draft genomes (such as AOQA01 000000). The numbers in parentheses are GenBank accession numbers 
identifying the underlying genome sequences. 



Conclusion 

Emended description of the species Phaeobacter 
inhibens Martens et al. 2006 

The description of the species Phaeobacter 
inhibens is the one given by Martens et al. 2006 
[1], with the following modification. The G+C con- 



tent, rounded to zero decimal places, is 60%. 
Phaeobacter inhibens is a facultative anaerobic 
bacterium by using nitrite reduction. 
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